Gene expression data preprocessing

نویسندگان
چکیده

منابع مشابه

Gene expression data preprocessing

We present an interactive web tool for preprocessing microarray gene expression data. It analyses the data, suggests the most appropriate transformations and proceeds with them after user agreement. The normal preprocessing steps include scale transformations, management of missing values, replicate handling, flat pattern filtering and pattern standardization and they are required before perfor...

متن کامل

Evaluation of Clustering Based on Preprocessing in Gene Expression Data

Microarrays have become the effective, broadly used tools in biological and medical research to address a wide range of problems, including classification of disease subtypes and tumors. Many statistical methods are available for analyzing and systematizing these complex data into meaningful information, and one of the main goals in analyzing gene expression data is the detection of samples or ...

متن کامل

Preprocessing of gene-expression data related to breast cancer diagnosis

The work is performed in close cooperation with the University of Tromsø and professor Eiliv Lund and is financed by the ERC TICE project. This note describes the preprocessing steps of gene expression data and focuses particularly on the filtering and normalization steps as the choices made here greatly affects the set of probes used in later analyses. In the filtering step, two parameters are...

متن کامل

Data Integration: an Approach to Improve the Preprocessing and Analysis of Gene Expression Data

The integration and evaluation of data from multiple DNA microarray datasets for a specific analysis is an important and yet challenging problem. In contrast to the majority of studies, which are focused on a particular biological problem, the present paper examines how the combination of several related microarray datasets affects different areas of preprocessing and analysis of gene expressio...

متن کامل

Enhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining

This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Bioinformatics

سال: 2003

ISSN: 1367-4803,1460-2059

DOI: 10.1093/bioinformatics/btg040